18:00
2026-06-23
research.ibm.com
artificial-intelligence
Running AI on mixed hardware for speed and affordability
IBM Research, Red Hat, and NxtGen Cloud Technologies demonstrated that using llm-d to serve AI models on mixed GPU hardware can boost inference speeds by 3 to 5 times and double throughput, enabling eโฆ